Self-Organizing Maps with Eligibility Traces: Unsupervised Control-Learning in Autonomous Robotic Systems
نویسندگان
چکیده
This paper presents the application of a connectionist control-learning system to autonomous mini-robots. These robots are specially designed and equipped for learning easily deened yet challenging control tasks. The system must learn despite noisy, low-grain sensory input and uncertain interactions between motor commands and eeects in the world. The system's design is severely constrained by the computing power and memory available on board the mini-robots and the on-board training time is limited by the short life of the battery. The connectionist system proposed ts into the Self-Organizing Neural Network with Eligibility Traces (SONNET) paradigm proposed by Hougen 7]. SONNET systems are capable of unsupervised learning of input space distribution (partitioning) and output responses in temporal domains. These systems are based on the well-known, self-organizing topological feature maps of Kohonen. They are augmented for response learning by the addition of eligibility traces. This combination of features allows these systems to solve diicult temporal credit-assignment problems rapidly through the use of information sharing in input and output neighborhoods.
منابع مشابه
Self-Organizing Feature Maps for Modeling and Control of Robotic Manipulators
This paper presents a review of self-organizing feature maps (SOFMs), in particular, those based on the Kohonen algorithm, applied to adaptive modeling and control of robotic manipulators. Through a number of references we show how SOFMs can learn nonlinear input–output mappings needed to control robotic manipulators, thereby coping with important robotic issues such as the excess degrees of fr...
متن کاملPartitioning Input Space for Control-Learning
This paper considers the eeect of input-space partitioning on reinforcement learning for control. In many such learning systems, the input space is partitioned by the system designer. However, input-space partitioning could be learned. Our objective is to compare learned and programmed input-space partitionings in terms of the overall system learning speed and proociency achieved. We present a ...
متن کاملImproving Security for SCADA Sensor Networks with Reputation Systems and Self-Organizing Maps
The reliable operation of modern infrastructures depends on computerized systems and Supervisory Control and Data Acquisition (SCADA) systems, which are also based on the data obtained from sensor networks. The inherent limitations of the sensor devices make them extremely vulnerable to cyberwarfare/cyberterrorism attacks. In this paper, we propose a reputation system enhanced with distributed ...
متن کاملUsing Sliding Mode Controller and Eligibility Traces for Controlling the Blood Glucose in Diabetic Patients at the Presence of Fault
Some people suffering from diabetes use insulin injection pumps to control the blood glucose level. Sometimes, the fault may occur in the sensor or actuator of these pumps. The main objective of this paper is controlling the blood glucose level at the desired level and fault-tolerant control of these injection pumps. To this end, the eligibility traces algorithm is combined with the sliding mod...
متن کاملSteel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps
Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...
متن کامل